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X): 

D , This paper considers adiabatic reduction in a model of stochas- 

pL^ ■ tic gene expression with bursting transcription. We prove that an 

adiabatic reduction can be performed in a stochastic slow/fast sys- 
^>s^ , tem with a jump Markov process. In the gene expression model, the 

production of mRNA (the fast variable) is assumed to follow a com- 
pound Poisson process (the phenomena called bursting in molecular 
. biology) and the production of protein (the slow variable) is linear as 

Qi^ ■ a function of mRNA. When the dynamics of mRNA is assumed to be 

faster than the protein dynamics (due to a mRNA degradation rate 
larger than for the protein) we prove that, with the appropriate scal- 
ing, the bursting phenomena can be transmitted to the slow variable. 
We show that the reduced equation is either a stochastic differential 
equation with a jump Markov process or a deterministic ordinary 
differential equation depending on the scaling that is appropriate. 
These results are significant because adiabatic reduction techniques 
seem to have not been developed for a stochastic differential system 
containing a jump Markov process. We hope that they are generaliz- 
■ able to give insight into adiabatic methods in more general stochastic 

1/^ ■ hybrid systems. In particular, a similar proof gives us a mathematical 

justification of the bursting of mRNA through an appropriate scaling 
^ ■ of an "ON-OFF" switching gene model. Last but not least, for our 

' particular system, the adiabatic reduction allows us to understand 

what are the necessary conditions for the bursting production-like of 
mRNA and protein to occur. 

- 

X. 

?H . 1. Introduction. The adiabatic reduction technique gives results that 

^ ■ allow one to reduce the dimension of a system and justifies the use of an 

effective set of reduced equations in lieu of dealing with a full, higher dimen- 
sional, model if different time scales occur in the system. Adiabatic reduc- 
tion results for deterministic systems of ordinary differential equations have 
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been available since the very precise results of Tikhonov (1952) and Fenichel 
(1979). The simplest results, in the hyperbolic case, give an effective con- 
struction of an uniformly asymptotically stable slow manifold (and hence 
a reduced equation) and prove the existence of an invariant manifold near 
the slow manifold, with (theoretically) any order of approximation of this 
invariant manifold. Such precise and geometric results have been generalized 
to random systems of stochastic differential equation with Gaussian white 
noise (Berglund and Gentz (2006), see also Gardiner (1985) for previous 
work on the Fokker-Planck equation). However, to the best of our knowl- 
edge, analogous results for stochastic differential equations with jumps have 
not been obtained. 

The present paper gives a theoretical justification of an adiabatic reduc- 
tion of a particular piecewise deterministic Markov process (Davis, 1984). 
The results we obtain do not give a bound on the error of the reduced sys- 
tem, but they do allow us to justify the use of a reduced system in the case 
of a piecewise deterministic Markov process. In that sense, the results are 
close to the recent ones by Crudu et al. (2012) and Kang and Kurtz (2012), 
where general convergence results for discrete models of stochastic reaction 
networks are given. In particular, these papers give alternative scaling of 
the traditional ordinary differential equation and the diffusion approxima- 
tion depending on the different scaling chosen (see Ball et al. (2006) for some 
examples in a reaction network model). After the scaling, the limiting mod- 
els can be deterministic (ordinary differential equation), stochastic (jump 
Markov process), or hybrid (piecewise deterministic process). For illustra- 
tive and motivating examples given by a simulation algorithm, see Haseltine 
and Rawlings (2002); Rao and Arkin (2003); Goutsias (2005). 

Our particular model is meant to describe stochastic gene expression with 
explicit bursting (Friedman, Cai and Xie, 2006). The variables evolve under 
the action of a continuous deterministic dynamical system interrupted by 
positive jumps of random sizes that model the burst production. In that 
sense, the convergence theorems we obtain in this paper can be seen as an 
example in which there are an infinite number of reactions (one for each 
"burst size") and give complementary results to those of Crudu et al. (2012) 
and Kang and Kurtz (2012). We hope that the results here are generalizable 
to give insight into adiabatic reduction methods in more general stochastic 
hybrid systems (Hespanha, 2006; Bujorianu and Lygeros, 2004). We note 
also that more geometrical approaches have been proposed to reduce the 
dimension of such systems in Bujorianu and Katoen (2008). 

Biologically, the bursting of niRNA or protein is defined as the produc- 
tion of several molecules within a very short time, indistinguishable within 
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the time scale of the measurement. In the biological context of models of 
stochastic gene expression, explicit models of bursting mRNA and/or protein 
production have been analyzed recently, either using a discrete (Shahrezaei 
and Swain, 2008; Lei, 2009) or a continuous formalism (Friedman, Cai and 
Xie, 2006; Mackey, Tyran-Kaminska and Yvinec, 2011) as more and more 
experimental evidence from single-molecule visualization techniques has re- 
vealed the ubiquitous nature of this phenomenon (Ozbudak et al., 2002; 
Golding et al., 2005; Raj et al., 2006; Elf, Li and Xie, 2007; Xie et al., 2008; 
Raj and van Oudenaarden, 2009; Suter et al., 2011). Traditional models of 
gene expression are composed of at least two variables (mRNA and protein, 
and sometimes the DNA state). The use of a reduced one-dimensional model 
(that has the advantage that it can be solved analytically) has been justified 
so far by an argument concerning the stationary distribution in Shahrezaei 
and Swain (2008). However, it is clear that two different models may have 
the same stationary distribution but very different behavior (for an example 
in that context, see Mackey, Tyran-Kaminska and Yvinec (2011)). Hence, 
our results are of importance to rigorously prove the validity of using a 
reduced model. 

1.1. Continuous- state bursting model. The models referred to above have 
explicitly assumed the production of several molecules instantaneously, 
through a jump Markov process, in agreement with experimental obser- 
vations. In line with experimental observations, it is standard to assume 
a Markovian hypothesis (an exponential waiting time between production 
jumps) and that the jump sizes are exponentially distributed (geometrically 
in the discrete case) as well. The intensity of the jumps can be a bounded 
function to allow for self-regulation. The assumption that the intensity is 
bounded is crucial, and makes the stochastic production term a compound 
Poisson process. 

Let X and Y denote the concentrations of mRNA and protein respectively. 
A simple model of single gene expression with bursting in transcription is 
given by 

(1) ^ = -7iX + Nih,^iY)), 
dY 

(2) — = -^^Y + \2X. 

at 

Here 71 and 72 are the degradation rates for the mRNA and protein re- 
spectively, A2 is the mRNA translation rate, and N{h,ip{Y)) describes the 
transcription that is assumed to be a compound Poisson white noise occur- 
ring at a rate ip with a non-negative jump size AX distributed with density 
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h. 

The equations (l)-(2) are a short hand notation for 



pt ft poc poo 

(3) Xt = Xq- 'jiXg-ds+ / / / l{r<^{Y ^)}zN{ds,dz,dr) 

Jo Jo Jo Jo 

(4) Yt = Yo- [ 72^-^5+ / X2Xsds. 

Jo Jo 

where Xs_ = hm^.^^- X{t), and N{ds,dz,dr) is a Poisson random measure 
on (0, oo) X [0, oo)^ with intensity dsh{z)dzdr, where s denotes the times of 
the jumps, r is the state-dependency in an acceptance/rejection fashion, and 
z the jump size. Note that (Xt) is a stochastic process with almost surely 
finite variation on any bounded interval (0,r), so that the last integral is 
well defined Stieltjes-integral. 

The following discussion is valid for general rate functions (p and density 
functions h{AX) that satisfy 

(5) (f G C^, (f and (f' are bounded, ie ip < ip , ip' < Ip 

/•oo 

(6) h e and / x'^h{x)dx < oo, Vn > 1. 

Jo 

For a general density function h, the average burst size is given by 

/•oo 

(7) ^ ~ xh{x)dx. 



Remark 1. Hill functions are often used to model gene self-regulation, 
so that Lp is given by 

where k, A, A are positive parameters and n is a positive integer (see Mackey, 
Tyran-Kaminska and Yvinec (2011) for more details). An exponential dis- 
tribution of the bursting transcription is often used in modeling gene expres- 
sion, in accordance with experimental findings, so that the density function 
h is given by 

HAx) = ie-^^/^ 

with b the average burst size. 

If ip is independent of the state Y, the average transcription rate is Ai = 
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699, and the average mRNA and protein concentrations are 

hip 

71 ' 

A2 _ bip\2 

A.eq — . 

72 7172 

1.2. Statement of the results. In the fohowing discussion, we consider the 
situation when mRNA degradation is a fast process, i.e. 71 is 'large enough', 
but the average protein concentration Y^q remains unchanged. Thus, we 
always assume one of the following three scaling relations: 

(51) is independent of 71 (= ^eq^ if is a constant) as 71 — > cx) and 
/i,A2 remain unchanged; or 

(52) h ^h{^) (so that b = Feq ^^yj^J as 71 00 and (p,X2 remain 
unchanged; or 

(53) A2/71 is independent of 71 (= ^eq-^|^ if is a constant) as 71 — 00 
and h, ip remain unchanged. 

In this paper we determine an effective reduced equation for equation (2) 
for each of the three scaling conditions (S1)-(S3). In particular, we show 
that under assumption (SI), equation (2) can be approximated by the de- 
terministic ordinary differential equation 

dY 

(10) = -72Y + X2HY) 

where 



(8) 
(9) 



X, 



cq 



eq 



(11) tl;{Y)=bip{Y)/j^. 

We further show that under the scaling relations (S2) or (S3), equation (2) 
can be reduced to the stochastic differential equation 

dY o - 

(12) _ = .^^Y + N{h{AY),^{Y)). 

where h is a, suitable density function in the jump size Ay (to be detailed 
below). 

We first explain, using some heuristic arguments, the differences between 
the three scaling relations and the associated results. When 71—7-00, apply- 
ing a standard quasi-equilibrium assumption we have 

dt 
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which yields 



(13) 



X(t) « -N{h{.),ip{y)) = N{jMlv),^{y)) 



71 



and therefore the second equation (2) becomes 



dY_ 
Itt 



71 




) 



Hence in (12), h{AY) = (A2/7i)"^/i((A2/7i)"^ AF) under the scahng (S2) 
and (S3). Furthermore, we note that the scahng (S2) also implies 71/1(71-) = 
/i(-), and therefore we can also write h{AY) = (A2)~^/i((A2)~^ Ay) under 
the scaling (S2). Finally, it is intuitively understandable that scaling (SI) 
gives (10), as the jumps become more frequent and smaller. 

In this paper, we provide three different proofs of the results mentioned 
above. In particular, we prove the results using a Master equation approach 
(the Kolmogorov forward equation) as well as starting from the stochastic 
differential equation. Note that both techniques have been used in the past, 
in particular within the context of discrete models of stochastic reaction 
networks. For the Master equation approach, see Haseltine and Rawlings 
(2005); Zeron and Santillan (2010) while for the stochastic differential equa- 
tion approach, we refer to Crudu et al. (2012); Kang and Kurtz (2012). 

1.3. Outline of the paper. This paper is organized as follows. In Section 
2, we consider the situation without auto-regulation so the rate if is inde- 
pendent of protein concentration Y. In this case the two equations (l)-(2) 
form a set of linear stochastic differential equations. We use the method of 
characteristic functionals to give a very short proof of the adiabatic reduc- 
tion when the white noise process is independent of the stochastic variables 
(external regulation). In Section 3 we consider the situation in which the 
rate (p is dependent on the protein concentration Y. We first give a result 
on the evolution equation on densities (Section 3.2), then we prove rigor- 
ously a weak convergence of the stochastic process using the machinery of 
generators (Section 3.3). Finally, in Section 4, we apply the same formalism 
with generators to prove that a binary "ON-OFF" model converges to a 
bursting model, thus illustrating that these techniques can be applicable in 
other contexts. 
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2. The case without auto-regulation. First, we consider the simple 
case with no auto-regulation in the mRNA transcription so the rate (p is 
independent of protein concentration Y . In this case, equation (1) is inde- 
pendent of (2) which makes the analysis easier. 

Consider the equations 

dX 

(14) — = -7iX + 7V(/i,99), X(0)=Xo>0, 
dY 

(15) — = -72y + A2X, y(o) = yo>o, 

where -/V(/i, (p) is a compound Poisson white noise, and is a constant. 
The solutions X{t) and Y{t) of (14)-(15) are stochastic processes uniquely 
determined by the equation parameters and the stochastic process N. 

We first state main results for each of the scaling relations we have con- 
sidered. 

Remark 2. In all scaling situations that we consider in the three the- 
orems below, the stochastic process Xt converges only in finite- dimensional 
law. Here finite- dimensional law means that for any finite number of fixed 
times (ii, ^2; ■ ■ ■in) the random variable {X{ti),X{t2)-, ■ ■ ■ X(t„)) converges 
in law towards the appropriate distribution. Convergence in finite-dimensional 
law is weaker than convergence in law of the stochastic process. We will see 
in the proofs of the theorems that this is because a lack of compactness for 
the process X{t). 

Theorem 1. Consider the equations (14)-(15). // the scaling (SI) is 

satisfied, i.e., (p = Yf.„— — , then when 71 — 00, 
0A2 

1. The stochastic process X{t) converges in finite- dimensional law to the 
(deterministic) steady-state value X^q; 

2. The stochastic process Y{t) converges in law towards the deterministic 
solution of the ordinary differential equation 

dY 

(16) = -I2Y + A2Xeq, y(0) = > 0. 

Theorem 2. Consider the equations (14) -(15). // the scaling (S2) is 
satisfied, i.e., h — )• ^^{^^) (^^ ^ = ^^ 7271 ^^ then when 71 00, 

1. The stochastic process X[t) converges in finite- dimensional law to the 
compound Poisson white noise N{h,ip); 
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2. The stochastic process Y{t) converges in law to the stochastic process 
defined by the solution of the stochastic differential equation 

dY o - 

(17) — = -72y + N{h, if), y(0) =Yo>0 

where h{AY) = (A2)~^/i((A2)~^Ay) is the density function for the 
jump size AY . 

Theorem 3. Consider the equations (14)-(15). // the scaling (S3) is 
72 7i 

satisfied, i.e., A2 = ^cq-; — , then when 71 — )■ 00, 

Dip 

1. The stochastic process X[t) converges in finite- dimensional law to the 
(deterministic) fixed value 0; 

2. The stochastic process Y{t) converges in law to the stochastic process 
determined by the solution of the stochastic differential equation 

dY o _ 

(18) — = -^2Y + N{h,^), Y{0)=Yo>0 

where h{AY) = (A2/7i)~^/i((A2/7i)"^Ay) is the density function for 
the jump size AY. 

Remark 3. Note that scalings (S2) and (S3) give similar results for 
the equation governing the protein variable Yf but very different results 
for the asymptotic stochastic process related to the mRNA. In particular, 
in Theorem 2, very large bursts of mRNA are transmitted to the protein, 
where in Theorem 3, very rarely is mRNA present but when present it is 
efficiently synthesized into a burst of protein. 

These theorems will be proved using the method of characteristic func- 
tionals. For a stochastic process ^£ (t > 0), the characteristic functional 
: S — )• M is defined as 



(19) Q[/]=E 



for any function / in a suitable function space S so that the integral 
j^if{t)^tdt is well defined. Before continuing, we need to introduce some 
topological background as well as properties of the Fourier transform in 
nuclear spaces (see Gel'fand and Vilenkin (1964)) 
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2.1. Stochastic process as a distribution. We are going to recall here the 
continuous correspondence between a stochastic process and a distribution. 
All stochastic processes we consider here are right continuous with finite left 
limits. It is natural to define a stochastic process as a random element in the 
space V = P^(0, oo), 7^^ , the space of all real-valued functions on (0,oo) 
that are right continuous with finite left limits. Such a space is classically 
endowed with the Skorokhod topology (see for instance Jacod and Shiryaev 
(1987, Chap. 6)). We define D(TZ'^), the space of smooth functions with 
compact support, with the inductive limit topology given by the family of 
semi-norms {k = 0,1,2...) pkif) = supl/^'^^l on every D{[0,n]), n € M (c.f. 
(Schaefer, 1971, Example 2, page 57)). Let / G DiJZ'^), and define x in the 
dual space D'{TZ^) such that 



(20) x(/) = / x{t)f{t)dt 

Jo 

Lemma 4. The map 

^2^^ p((o,oo),7^) ^i^'(7^+) 

{xt)t>o ^ X, 

where x is defined by equation (20), is continuous. 

Proof. It is a classical result that x ^T> has at most a countable number 
of discontinuity points so that x is locally integrable, the integral in Eq (20) 
is well defined for all / G D{n+), ± G D'(7^+) and 



x(/) |< ( / I x{s) I ds) II / 



for any / with support in [0, T] (Rudin, 1991, Section 6.11, page 142). We 
conclude by noticing that 

I x(/) |< sup I x{s) III / lloo T 

s<T 

and X 1-^- sup^<^ | x{s) \ is continuous for the Skorohod topology (Jacod 
and Shiryaev, 1987, Proposition 2.4, page 339) for all T such that T is not 
a discontinuity point. □ 
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2.2. Bochner-Minlos theorem for a nuclear space. Let be a nuclear 
space. We state a key result that will allow us to uniquely identify a measure 
on the dual E' of E. 

Bochner-Minlos Theorem. (GeVfand and Vilenkin, 1964, Theorem 
2, page I46) For a continuous functional C on a nuclear space E that sat- 
isfies C(0) = 1, and for any complex zj and elements Xj € A, j, k = 1, ...,n, 

n n 

(22) J^ZjZfcC(x, -Xfc) >0, 

j=i k=i 

there is a unique probability measure fi on the dual space E' , given by 

(23) C{y) = f e'^'-'y^dfiix). 

Je' 

Note that the space DiTV^) is a nuclear space (Schaefer, 1971, Example 
2, page 107). 

2.3. The characteristic functional of a Poisson white noise. The use of 
the characteristic functional allows us to define a generalized stochastic pro- 
cess that does not necessarily have a trajectory in the usual sense (like in 
T> for instance). Indeed a (compound) Poisson white noise is seen as a ran- 
dom measure on the distribution space associated with the characteristic 
functional (given in Hida and Si (2004), here / € D{TZ~^)) 

LP / / (e^^^W - l)h{z)dzdt , 
. Jo Jo J 

where if is the Poisson intensity and h the jump size distribution. It is not 
hard to see that C^[/ — g] and Cjy[g — f] are conjugate to each other, 
Cj^[0] = 1 and C^[.] is continuous for h G L^{Tl'^), so the conditions in the 
Bochner-Minlos Theorem 2.2 are satisfied and therefore Cj^ uniquely defines 
a measure on D'(TZ'^). 

We refer to Prokhorov (1956); Hida and Si (2004, 2008) for further ma- 
terial on characteristic functionals and generalized stochastic processes. 

2.4. Proofs of Theorems 1 through 3. The proofs of Theorems 1 to 3 
are based on the idea of Levy's continuity theorem. However in the infinite- 
dimensional case, the convergence of the Fourier transform does not imply 
convergence in law of the random variable, and one needs to impose more re- 
strictions, namely a compactness condition. We will use the following lemma 
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Lemma 5. Let X"' be a sequence of stochastic processes m D^(0, co),Tlj . 

Suppose X" is tight in 2^^(0, oo), 7^^ and that there exists a random variable 
X such that, for all f in D{TZ~^), as n oo, 

Cx4f]^Cx[f]. 

Then X" converges in law to X in 2?^(0, oo),TZ 

Proof. The convergence of the characteristic functional, the Bochner- 
Minlos Theorem 2.2 and the continuity Lemma 4 ensure that the sequence 
X" has at most one hmiting law, which has to be the law of X. The clas- 
sical Prokhorov Theorem (Jacod and Shiryaev, 1987, Corollary 3.9, page 

348) states that tightness of X^ in P^(0, oo),7^^ is equivalent to relative 

compactness of the law of X" in 'P(P), the space of probability measures on 
P (with the topology of the weak convergence). Then X" converges in law 

to X in P('(0,oo),7^). □ 



Now, we give the proofs of Theorems 1 through 3. The process is similar 
for each, and we only present a detailed proof for Theorem 1 and sketch the 
main differences in the proofs for Theorem 2 and 3. 

Proof of Theorem 1. For any / G D{TZ'^), from (14)-(15) and noting that 
the initial conditions Xq and Yq are deterministic, it is not difficult to verify 
that (see also Cceres and Budini (1997)) 

(25) Gx[f] = e^^i^«G^[A(t)], = e*'=^^°Gx[A2/2(t)], 
where 

poo POO 

h= e-^''f{s)ds, h{t)= e-^'(^-*)/(s)d5, (i = l,2). 
JO Jt 

Note that for any function / € D[1Z'^) the functions fi{t) also belong to 
D{TZ'^) and therefore the characteristic functionals in (25) are well-defined. 
Furthermore, the characteristic functional of the compound Poisson white 
noise has been derived in Equation (24). 

Now, we are ready to complete the proof by calculating the characteristic 
functionals Gx and Gy when 71 ^ oo from (25) and (24). First, we note 
that ki = fi{0), and when / € D{jV~) and 71 — > 00, 

(26) f^it) = -f{t) + 0{-,). 

71 7i 
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Furthermore, from (25)-(26), we have 
Gx[f] = 



1 ^TV 



l/(t) + 0(l) 
71 7i 



(27) 



e Ti ^ ' exp 



/o JO 71 

Thus, from the scahng (SI) and (7), we have 



f {t)xh{x)dxdt) 



+ 0{- 



71 



hm Gx[f] 

71— >oo 



(28) 

Therefore, (25) yields 



hm GyU] 

7i— >-oo 



(29) 



exp 



e^*^2^o exp 



f{t)Xeqdt 



f2{t)Xegdt 



/(«)(! 



)Yeqds 



Now, it is easy to verify that the right hand sides of (28) and (29) give, 
respectively, the characteristic functional of X{t) = and Y{t) of the so- 
lution of (16). Thus, the process X{t) of (14) converges in finite-dimensional 
law to Xcq (by Levy's continuity theorem). 

Finally, to show that the process Y{t) defined by (15) converges in law 
towards the solution of (16), we apply Lemma 5 and only need to show that 

a sequence {Y-P} defined by {Ai^„} with Ai^„ — > 00 is tight in T>(^{0, 00), 7^^ . 

Let {Ai^n} be a sequence of positive constants so that Ai^„ 00 when 
n 00, and {Xf} and {Y/''} are two sequences of stochastic processes such 
that {X^,Yf-) is determined by (14)-(15) with Ai replaced by Ai^^. 

For any n, let A^^^ be a compound Poisson process with {Tn^i}^i the jump 

times which occur at a rate = Ycq , , and {Zn^i}'^^ the jump sizes 

0X2 

that are iid random variables with density h (with the convention r„^o = 
and Znfi = Xq). Then 



Zn,it 



t>T„ 



From (4), we have 



Yl" <Yo+ f \2X{s)ds < + — V Zn,^ = >0 + 
Jo 7l,n 



71/ 
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Nn{t) = Nn{t) + ipnt / zh{z)dz = Nn{t) + ifnbt, 



Now, 



where is a compensated compound Poisson process (that is E[A''„] = 0). 
Nn is a right continuous Martingale. Then by Doob's inequahties (Ethier 
and Kurtz, 2005, Corollary 2.17, page 64)), 

V(snp I Nn{t) \> k)<^E\ Nn{T) \ . 



Now let T > 0, e > 0, and K be such that Vy\Y^\> K/2J < |, so we have 

e , 4 \2^nhT 



(30) 



vU.^\Yr\>K)<U^'-^<e 

^t<K J Z K 71 n. 



K 71, n 



for K sufficiently large, according to hypothesis (6) and the scaling (SI). 
Similarly, for any ti,t2 G [0,7"], 



^t2 ~ ^ti 1^ 



11, r. 



Nnit2) - Nnih) 



and, for any T > 0, e > 0, r/ > 0, 
(31) 



r ( I K -yt:\>v)< ^^^'^"^ ' " ' < e 

for I t2 — ^1 I sufficiently small. With (30)-(31) and by classical criteria for 
tightness in V (Jacod and Shiryaev, 1987, Theorem 3.21, page 350) {Y^""} is 

tight in v(^{0,oo),ny 

□ 



Proof of Theorem 2. Let 



71 71 



The proof is similar to the proof of Theorem 2. Note simply from the scaling 

(S2) that (27) becomes 

(32) 

poo poo "I -1 

iip I I {e'^^'^''/^' -l)h^,{x)dxdt +0{- ' 



Jo 



71 
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Thus, we have 



hm Gx[f] = lim exp 

71— >oo 71— i>oo 



lip 



JO 

oo roD 

Jf{t)x 



^^if(t>h^ _ l)h{—)d{-)dt 

7i 7i 



JO 



exp tip 
Furthermore, from (25) 



71— >oo 



l)h{x)dxdt 



(33) 
where 



e'^^^° exp 



e^'^^^o exp 



ip 
iip 



oo />oo 



Jo 

oo /"OO 



^0 



(giAax/aW _ i)h^x)dxdt 
(gi^/aW _ l)h{x)dxdt 



h{x) = A2 ^/i(A^^x). 



It is easy to verify that (33) is just the characteristic functional of the 
stochastic processes given by solutions of (17). 

The rest of the proof is similar to the proof of Theorem 1 (Equations 
(30)-(31) are still valid) and the details are omitted. 



□ 



72, 



Proof of Theorem 3 Here, it is sufficient to note that A2 = Y^q — hp in the 



scaling (S3), and furthermore 



(34) hm Gx[f] = hm exp 

71— >-oo 7i— >-oo 



and 



71 



— l)h[x)dxdt 



lim GY[f] 

71— >oo 



(35) 
where 



e^*^2^» exp 



e^^^^i'o exp 



^0 



00 fOO 



Jo 

00 roo 



Jo 



(gi(A2/7i)^/2(i) _ l)h{x)dxdt 

^e^'^Mt) _ i)h{x)dxdt 



A2 ^2 



□ 
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3. The case with auto-regulation. We now turn to a consideration 
of the adiabatic reduction of equations (l)-(2) in the general case when the 
jump rate (p is dependent on protein concentration, i^iY). 

We use two different methods to arrive at our resuhs. The first method 
is based on the density evolution equations, and shows that the evolution 
equations obtained from the two stochastic differential equations are con- 
sistent with each other when 71 — )> +00. Existence of densities for such 
processes has been rigorously proved in Tyran-Kaminska (2009); Mackey 
and Tyran-Kamihska (2008). The second method is based on the conver- 
gence of generators and shows that the stochastic process Y{t) given by 
equations (l)-(2) converges in distribution to that of the solution of equa- 
tion (12). Background on generators and convergence of stochastic processes 
can be found in Ethier and Kurtz (2005), and generators for hybrid systems 
have been characterized in Davis (1984); Hespanha (2006); Bujorianu and 
Lygeros (2004). Though the second method implies the first, the two proofs 
using the two approaches are interesting and give different insights into the 
problem. 

We first summarize the important background results on the stochastic 
processes described in our paper. 

One dimensional equation. For the one-dimensional stochastic differential 
equation (12) perturbed by a compound Poisson white noise, of (bounded) 
intensity (/j(y) and jump size distribution /i, the infinitesimal generator of the 
stochastic process {Yt)t>Q is (Davis, 1984, Theorem 5.5), for any / G V{A), 

(36) Aif{y) = -72y^ + v{y) ( Hz - y)f{z)dz - f{y)) 

(37) V{Ai) = {/ e M{<d,oo) : t ^ fiye'^^^) is absolutely 

continuous for t € 7^^ and 

T,<t 

where M{0, 00) denotes a Borel- measurable function of (0, 00) and the times 
Ti are the instants of the jump of yt- It is an extended domain containing all 
functions that are sufficiently smooth along the deterministic trajectories 
between the jumps, and with a bounded total variation induced by the 
jumps. 

The operator Ai is the adjoint of the operator acting on densities v{t,y) 
given by Mackey and Tyran-Kamihska (2008) 

(38) ^^^^ = ^ b2yv{t, y)] + ip{z)v{t, z)h{y - z)dz - ip{y)v{t, y). 
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For any / G V{Ai), we have 

(39) jEf{Xt) = EAi{f{Xt)). 

Two dimensional equation. Consideration of the two-dimensional stochas- 
tic differential equation (1-2) perturbed by a compound Poisson white noise, 
of intensity ip{y) and jump size distribution h follows along similar lines. Its 
infinitesimal generator and extended domain are 

(40) A2g{x,y) = -'jix^ + {X2X - -/2y)^ 

+ V>{y)\ / Hz- x)g{z,y)dz - g{x,y)], 



(41) V{A2) = {g € M{{0,oof ) : t ^ g{(j)t{x,y)) is absolutely 
continuous for t € TZ^ and 

E V |5(^T. , Yt, )-g{X^-,Y^-)\<cx> for aU t > 0} 
T,<t 

where </>t is the deterministic flow given by equations (1-2). 
The evolution equation for densities u{t, x, y) is 

diiit X V) d d 

W — ^ -Q^bi^'^it^x^y)]- ^[{^2X --i2y)u{t,x,y)] 



X 



+ / V{y)u{'t^z,y)h{x - z)dz - ip{y)u{t,x,y). 
Jo 

For any / G V{A2), we have 

(43) jEf{Xt,Yt) = EA2{f{Xt,Yt)). 

Note that from the results of Section 2, demonstrating convergence in law 
for the first variable Xt is impossible. Rather, the finite-dimensional conver- 
gence properties are replaced by scaling behavior of the marginal moment. 
As expected, this scaling depends on the hypotheses (S1),(S2) or (S3). This 
scaling behavior is crucial for the convergence results and is discussed in the 
following subsection before we prove the main results. 
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3.1. Scaling of the marginal moment. Using the generator A2 for the 
two-dimensfonal stochastic process defined by (41), we can deduce the scal- 
ing laws of the marginal moment of X" as 7" — )> 00. 

Proposition 6. Let {X{t),Y{t)) be the solutions of (l)-(2), and ^k{t) = 
E[X{t)^] and Vk{t) = ¥.\Y{t)X(tf]. Suppose Hk{0) < 00 and iJk{0) < cc, 
then /ifc(t) < 00 and Vk{t) < 00 for all t. Moreover, for fixed t > 0, 

1. If the scaling (SI) holds, then both /iyfc(t) and Vkit) stay uniformly 
bounded above and below as 71 — t- 00. 

2. For the scaling (S2), then, for k > I, 

(44) /ifc(t) ~ 7f-\ i^kit) ~ 7f"S (71 ^ 00) 

and 1^0 (0 uniformly bounded above and below as 71 — > 00. 

3. If (S3) holds then, fork>l, 

(45) fik{t) ~ 7^^ ^k{t) ~ 7^^ (71 00) 

and 1^0 (0 ^-^ uniformly bounded above and below as 71 — > 00. 

Proof. The proposition is proved using the evolution equation for the 
marginal moment obtained from the generator A2- 

First, we claim that functions and x^y (Vfc G N*) are contained in 
P(^2)- To show this, we only need to verify that 
(46) 

E \X{Ti)''Y{Tiy - X{Tr)''Y{Try\ < 00, > 0, A; G N*, / = 0, 1, 

n<t 

where the Tj are jump times. Since Y{t) is continuous and from equation 
(4), Elj < EYq + Jq KXsds, we only need to verify the case with I = 0. 

Let Zi be the random variable associated with the i^^ jump, and Nt the 
number of jumps for which Ti <t. Then X{Ti) = X{T~) + Zi, and thus 

EY,\{Xm)'-X{Tr)''\ = E^^(^:)x(I;-)^Z^^■ 

Ti<t Ti<tj<k ^-^^ 

1=1 j<:k 1=1 
< (ENt)C{m,E'^h,--- ,E^h) <oo, 

where E^h = x^h{x)dx is the j*^ moment of h, and C{Eh, E^/i, ...,E''h) is 
a polynomial in the first k^^ moments. Here we have used the independence 
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of the jump times and the jump sizes, as well as the upper bound of the 
trajectories Xt due to the following formal expression 



Xt = Y H{t - Ti)Z,e~ 

T,<t 



where H stands for the Heaviside step function. 

Now, A2X^ and A2X^y are well defined. A straightforward calculation 
yields 



A2X^ = --fikx^ + Lp{y)\^j h{z - x){z - X + xfdz - x^ j 
= -jikx'' + ip{y)'Y{ -l^l h{z - x){z - x)''~'dz 



i=0 
k-l 



i=0 



Then the /c^'^-marginal moment /Xfc(t) of the first variable x depends only on 
the lower moment fj-i{t), i < k. We then obtain, with assumption (5) and 
equation (43) 



0— n \ ^ 



i=0 

(47) /ifc(t) < -^ikfik{t)+^Yl (^)^^im''~'h. 

1=0 

Assume scaling (SI). Applying Gronwall's inequality to equation (47) for 
k = 1 yields, for alH > 0, 

^+0(4)<Mi(*)<^ + 0(4) 

71 7i 71 7i 

Iteratively, for all i > and k > 1, there is a constant > independent 
of 7i (where depends only on the moment of h) such that 

^£f^ + 0(Jj)<M.(t)<^ + 0(Jj). 

71 7i 71 7i 

Assume (S2). The case k = 1 follows directly from the above calculations, 
and for all A; > 1 and t > 0, 

^+o(7n<M.«)<^+ohn. 

fe7i 71 
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Finally, assume (S3). The same method shows that for all t > and A; > 1, 
there is a constant independent of 71 (c^ depends of the moment of h and 
of if) such that 

^ + 0(^)</x,(t)<^ + 0(l). 

71 7i 71 7i 

A similar calculation with g{x, y) = yx^ gives analogous scaling. Namely, 
we have 

^2xS = (-71 + 72)xS + A2x'=+^ + v?(y) f ^"j x*E^^-^/i 
so that, for /c > 1, 

(-71 A: + 72)z^fc(i) + Aa/ifc+i + ( • ) /^^(i)IE''^*/i 

< Mt) < (-7ifc + 72)i^fc(t) + A2R.+1 +^ ('^Vi(i)E*^"'^ 
while for A; = 0, we obtain 

^0 = -72Z^o + A2/il. 

Then i/q is uniformly bounded for each scaling (S1),(S2), and (S3). Then, 
iteratively using the inequalities for iJf^, the scaling of ^fc+i and Gronwall's 
inequality yields the desired result for each scaling. □ 

3.2. Density evolution equations. Let u{t,x,y) be the density function 
of {X,Y) at time t obtained from the solutions of equation (l)-(2). The 
evolution of the density u{t, x, y) is governed by 

— = ■7^bixu{t,x,y)]- —[{X2X-'y2y)u(t,x,y)] 

(48) ^* .X 

+ / ^iy)uit,z,y)h{x - z)dz - (p{y)u{t,x,y) 
Jo 

when {t,x,y) € (0, +00)^. 

In this subsection, we prove that when 71 — ?• 00 the density function 
u(t,x,y) approaches the density v{t,y) for solutions of either the determin- 
istic equation (10) or the stochastic differential equation (12) depending on 
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the scaling. Evolution of the density function for equation (10) is given by 
Lasota and Mackey (1985) 

(49) = -^[-^2y^^o + A2V'(y)uo]. 

Here we note that 

(50) v(y) = My)/7i- 

Evolution of the density for equation (12) is given by 

dv(t,y) d fy - 

(51) — — = ^b2yv{t,y)]+ ip{z)v{t,z)h{y - z)dz - Lp{y)v{t,y) 



dt dy 
when (t,y) € (0, +oo)^. Here h is given by 

(52) Hy) = ^H^y). 

We note that under the scaling (S2), h{y) = A2 ^^(Ag ^y) is independent of 
71, and under the scaling (S3), 71 /A2 is independent of 71, and therefore h 
is also independent of 71. 

In the following proof, we will need existence of derivatives of all order of 
the densities functions. Thus, in equations (48)-(51) we make the additional 
hypotheses: 

(HI) The density function h satisfies (6), and h € 

(H2) The function h is given by Theorem 2 or Theorem 3 depending on the 
scaling; 

(H3) The rate function (p{Y) satisfies (5) and tp E C°°. 

When conditions (5)-(6) are satisfied, existence of the above densities 
have been rigorously proved in Mackey and Tyran-Kamihska (2008); Tyran- 
Kaminska (2009). In particular, for a given initial density 

(53) u{0,x,y) = p{x,y), < x,y < +00 
that satisfies 

(54) p{x,y)>0, / / p{x,y)dxdy = 1, 

Jo Jo 

there is a unique solution u{t, x, y) of (48) that satisfies the initial condition 
(53) and 

fOO POD 

(55) u{t,x,y)>0, / / u{t,x,y)dxdy = 1 

Jo Jo 
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Moreover, if the moments of the initial density satisfy 



(56) Uniy) = / x''p{x,y)d. 



Ix < +00, Vy > 0, n = 0, 1, • • • , 



Jo 



then the marginal moments 



poo 



(57) 



Un{t,y) = / x^u(t,x,y)d: 
Jo 



\x 



are well defined for t > and y > from the discussion in Section 3.1. 
Therefore 



(58) 



lim x"'u{t, x,y) = 0, Vt,y. 



Here, we will show, using semigroup techniques as in Mackey and Tyran- 
Kamihska (2008); Tyran-Kaminska (2009), that under the hypotheses (Hl)- 
(H3), the densities are smooth. We will use the following result 

Proposition 7. (Pazy, 1992, Corollary 5.6, page 124) Let Y he a sub- 
space of a Banach space X, with {Y, \\ . ||y) a Banach space as well. Let 
T{t) be a strongly continuous semigroup on X , with infinitesimal generator 
C. Then Y is an invariant subspace ofT{t) if 

• For sufficiently large X, Y is an invariant subspace of R{X, C) 

• There exist constants ci and C2 such that, for A > C2, 



• For A > C2, ii(A, C)Y is dense in Y . 
Then, we have 

Lemma 8. Assume (HI )-(H3). If the initial condition z;(0, y) e H 
then the unique solution of equation (51) (respectively (49) ) v{t, y) G C°° H . 
Similarly if the initial condition u{0,x,y) G {C°°)'^f]L^ then the unique so- 
lution of equation (48) u{t,x,y) G {C'^)'^f]L^. 

Proof. Because the dynamical system given by (10) is smooth and in- 
vertible, the result for equation (49) is standard (Lasota and Mackey, 1985, 
Remark 7.6.2 page 187). We will show that the result for equation (51), and 
the result for equation (48) will follow in a similar fashion. We need to show 
that the subspace Cq° C is invariant under the action of the semigroup 
defined by equation (51). According to Mackey and Tyran-Kamihska (2008) 
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(and references therein), we know that the semigroup defined by equation 
(51) is a strongly continuous semigroup whose infinitesimal generator C is 
characterized by the resolvent 



(59) R{X,C)v= Mm R{X,A)y"{P{^R{X,AWv 

n=0 

for all V £ , X > 0, where the hmit holds in and A and P are the 
operators given by 

(60) Av{y) = ^^M^ - ^{y)viy) 

ry 

(61) Pv{y) = / v{z)h{y — z)dz, 



and the resolvent R{X,A) is given by, for all v G L^, 

/•oo 1 

(62) R{X,A)v{y)= e^^^'^-^^^y'^v{z)dz 

Jy I2y 

with Q\{y) = _:M!!M _ P ^^^dz We also know that for 

72 12Z 

V G T){A) = {v £ : (yv) is absolutely continuous and ^ ^ ^ G L^}, 
we have 

(63) Cv = Av + P{ipv). 

We will now use the result from by Proposition 7 above to complete the 
proof. Note that according to (H1)-(H3), Q\ is a C°° decreasing function, 
so that for v € C^, RiX, A)v € C^. Moreover, a simple computation yields, 
for all A > 7, 

I R{X,A)v{y) I < sup I v{z) \ — <|| v " 



--r- I - v-y I •> _ II - llOO . 

(y,oo) A - 7 A - 7 

Then _ 

\\iP{ipRiX,A)))v\\^<\\v' 



A-7 

{P{^R{X,A))rv ||oo<|l V Hoc {y^y 



and _ 

II {P{vR{X,A))rv ||oo<|| f Hoc fv^ 

7/ 
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so that convergence in equation (59) holds in C°° and is invariant for 
R{X, C). The second condition in Proposition 7 foUows then by the previous 
calculations. Finally, because i?(A, C) = (A — C)~^, to show that R{\, C)Cq° 
is dense in C^, it is enough to show that 

(A - C)C^ C C^. 

According to Equation (63) and hypothesis (HI) — {H3), this is true. □ 

The main result given below shows that when 71 is large enough, uo{t,y) 
as defined above gives an approximate solution of (49) or (51). 

Theorem 9. Assume (H1)-(H3) hold. Let u{0,x,y) £ {C'^)'^f]L^. For 
any 71 > 0, let u{t, x,y;^i) be the associated solution of (48), and define 

/•oo 

(64) U(){t,y;'yi) = u{t,x,y;'yi)dx. 

Jo 

(1) Under the scaling (SI), when 71 — > 00, uo(t, y;7i) approaches the so- 
lution of (49). 

(2) Under the scaling (S2) or (S3), when 71 — )■ 00, uo{t,y]ji) approaches 
the solution of (51) with h defined by (52). 

In all cases, convergence holds in , uniformly in time on any bounded 
time interval. 

Proof. Throughout the proof, we omit 71 in the solution u{t,x,y;'yi) 
and in the marginal density Uo(t, y; 71), and keep in mind that they depend 
on the parameter 71 through equation (48). 

First, let 



(65) Un{t,y)= / x'^u{t,x,y)dx, n = 0, 1, 

Jo 

which are well defined from the previous discussion. 
From (48) and (58), we have 



(66) 



Since 



dun - dun+i , d{yun 

- -n-fiUn-X2^ h72 



ot _ _ ay ay 

ip{y)x"-u{t, z, y)h{x - z)dz - ip{y)un. 



Jo 



10 JO 



ip{y)x'^u{t, z, y)h{x - z)dz = ( . ] 'f{y)un-~j'&h, 
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where 

poo 

Wh= / x^h{x)dx, 
Jo 

we have 

(67) — = -njiun - ^2^^ + 72^^ + fiy) 2^ J^/n-,E^/l. 

In particular, when n = 0, 

(68) — - = -A2^ h72- 



9t 9?/ (?y 

When n > 1, we have 



(69) — = -nur. 
71 5* 



71 5y 71 5y 71 ^ VJ/ 



When n > 1 and 71 — >• oo, we apply the quasi-equilibrium assumption to 
(69) by assuming 

1 dun 
71 dt 

when t > to > 0, and hence 
(70) 

^ ^"'^"^ + + —^{y) ( ]un-,K^h, (Vn > 1). 



n-yi dy n-fi dy 



1 " ^n\ 

+ — v{y)J2r.]un-j&} 

"71 ~t vjy 



Now, we are ready to prove the results for the three different scalings. 
(SI). For the scaling (SI), ip{y) 71, and we have 

_ 1 ^{y) ^fn\ , 1 



n 71 



X:(")n„_,E^7. + 0(l), 



so 



(71) ., = M^no + 0(l), 

71 71 

where b = Kh. Substituting (71) into (68), we obtain 

(72) ^ = ^[72y«o - A2^(y)no] + 0(1; 
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with ip{y) = bip{y)/^i. Finally, note that 



f'^ duo{t,y) 
uo{T,y) = / dt, 







dt 



so (1) follows and convergence holds in C^, uniformly in time on any bounded 
time interval. 

(S2). We assume the scaling (S2) so /i(AX) ^^(^) 7i ~^ ^^"^ 
the re-scaled j*'^ moment 

is independent of 71. Hence, from (70) and Proposition 6, we have 

-(n-l) A2 (9(7f"nn+l) 72 g(j/7l~^"~^^^n.) 1 ^ n , 

7i w„ = ^— ^ h -(p{y)uobn 

n oy n7i ay n 



1 ^ /n\ 

+ — Qj7r^""'"'^«n-,fci 

-bn(p[y)uo ^ h 0(— 

n n oy 71 



Therefore, 



^ 1 

ni = bnp{y)uo - A2^[7r^^^2] + 0{—) 
oy 71 

2 2 dy 71 

A 9 

= 6iV3(y)no - b2^-^{(p{y)uo) 

/ N A2 5(7rV)i 1 , 



j^(/c + l)! Sy'^ 71 
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Thus, when 71 ^ cx), we have 



dui 



uai \ ^ 

9^ ^ ^ 



-A ''^ rl^ 



fc=i 
00 



00 Qk 



k=l 



kV 71 



x^h{x)dx)^{ip{y)uQ) 



roc _ ^ Qf~ 



dx 



h{x){ip{y - x)uo{t, y -x) - ip{y)uoit, y))dx 

h{x)ip{y - x)uo{t, y - x)dx - ip{y)uo{t, y) 

—00 

h{y - z)ip(z)uo{t, z)dz - ip{y)uo{t, y) 
ry _ 

= / h{y - z)(p{z)uo{t,z)dz - ip{y)uo{t,y). 
Jo 

Here we note (p{z) = when z < 0. 

Therefore, from (68), when 71 — )• 0, uq approaches to the solution of (51), 
and the desired result follows. 

(S3). Now, we consider the case of scaling (S3) so A2/71 is independent of 
71. From (70) and Proposition 6, we have 

l\2dUn+l , 72 d{yUn) 1 / N Tn,„, 

Un = ^ \ ^ \ ip{y)uoE''h 

n 71 oy nji oy n^i 



+—ip{y) V [.]un 
nil J^i\jJ 



^^(,)noE"/.-i^^ + 0(^) 
77,71 n7i dy 7f 
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Therefore, 



1 \ d 1 

—ip{y)uoE^h —U2 + 0{^] 

71 71 dy 7f 



-ip{y)noE'h - ^^[^^(y)noE^h - ^^^u-,] + 

71 71 dy 271 2 71 5y 7f 



7i 2!7f ay 



2! 71 dy 371 3 71 au4 



1 °° 1 A r)^~^ 1 

A2 ^ 71 dy 7i 

Thus, when 71 — > 00, in a manner similar to the above argument, we have 

y _ 

^(y - z)ip{z)uo{t, z)dz - Lp{y)uo{t, y), 





and the result follows. □ 

3.3. Result on generators. In this approach, we use a result on semi- 
groups and the convergence of generators to prove convergence in law for 
the stochastic process. The main result is 

Theorem 10. Let Y{t) denote the stochastic process determined by the 
solution of (l)-(2). 

1. Assume the scaling (SI), i.e., (p{y)/ji = ip{y) as -ji 00 and h, A2 
remain unchanged. Then Y{t) converges in law towards the determin- 
istic solution of the ordinary differential equation 

dY 

(73) — = -I2Y + X2hm), ^(0) = lo > 0. 

2. Assume the scaling (S2), i.e., h — )• ■^h{^^) as 71 —)■ cxo and (p,X2 
remain unchanged. Then Y{t) converges in law towards the solution 
of the stochastic differential equation 

dY o _ 

(74) — = -^^Y + N{h,^{Y)), Y{0) = Yo>0 
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with h{AY) = X2^h{X2^AY) the density function in the jump size 
AY. 

3. Assume the scaling (S3), i.e., A2/71 — )■ A2 as 71 —)• 00 and h, ip remain 
unchanged. Then Y(t) converges in law towards the solution of the 
stochastic differential equation 

dY o _ 

(75) — = -72^ + Nik, c^(y)), y(o) = Yo>0 

with h{AY) = A2 ^h{X2 ^AY) the density function in the jump size 
AY. 

The proof of the theorem is based on the following result. 

Kurt's Theorem. (Ethier and Kurtz, 2005, Theorem 2.5, page 167) 
Let X and he Feller processes in (0, 00) with semigroups {Ttj and {Tn,t) 
respectively. Then the following two conditions are equivalent 

• Tn^tf Ttf for every f ^ Cq, uniformly in time for hounded t > 0. 

• If X^{0) X{0) in distribution, then X^ X in distribution. 

where Co is the class of continuous functions with /(y) — > as y ^ 00. 

Here a Feller process is a Markov process with a probability transition 
function associated with a Feller semigroup as given below. Let Xt : (0, 00) — )• 
be a Markov process, and consider a semigroup defined on the space of 
bounded continuous functions given by 

Ttf{x) = E,./(Xt), 

the expectation of the function / along the stochastic process with given 
starting value at x. By definition, a Markov process is Feller if 

• Tt(Co) C Co 

• For every / G Cq, \\ Ttf \\oo<\\ f ||oo 

• For every / G Cq, limt^o II Ttf - f ||oo= 

i.e. if Tt is a strongly continuous contracting semigroup on Cq. 

Lemma 11. Assume hypotheses (5) -(6). Then the stochastic processes 
defined hy equations (l)-(2) and by equation (12) are Feller processes. 

Proof. We give a detailed proof for the case of equation (12). The con- 
traction property is immediate. Note that the solution of the deterministic 
dynamical system, starting at y at t = 0, is given by 

(^{t,y)=ye-^'' 
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Then if / S Cq and Ti is the first jump time, set 

H{t, y) = P(ri > t) = exp ( - / (^(./.(s, y))ds) , 

Jo 

so for any bounded continuous function / we have the following representa- 
tion for the semigroup Pt (Crudu et al., 2012, Equation 28, page 26) 



Ttfiy) = fm,y))H{t,y) 

Ct /"OO 



(76) 



+ 



Tt^sf{z)h{z - (l){s,y))(p{(j){s,y))H{s,y)dzds. 



lO J<t>{s,y) 

For any yi > y2, then (^(t, yi) > (p(t,y2) and 
\Ttfiyi)-TJ{y2) I <|/(0(t,yi))iJ(t,2/i)-/(0(i,2/2))i?(i,2/2) I 

+ 1/ / Tt-sfiz) h{z-cP{s,yi)Mc^is,yi))H{s,yi) 

Jo J<p{s,yi) L 



-h{z - 0(s, yi))ip{(t>{s, yi))H{s, yi) 



dzds 



Tt-sf{z)h{z - 0(s, y2))(p{(l){s, y2))H{s, y2)dzds \ 



J<P{s,y2) 



so that 



Ttl{yi)-Ttf{y2) I <| fm,y^))H{t,y^) ~ f{(^{t,y2))H{t,y2) 

poo 



+ 11/ 



+ 11/ 



I Hz - (l>{s,yi))v>i<Pis,yi))H{s,yi) 

~h{z - (t){s, yi))ip{<p{s, yi))H{s, yi) \ dzds \ 
t r<l>{s,yi) 

/ IHz- (f){s, y2))ipi(f){s, y2))H{s, 7/2) | dzds, 

J<p{s,y2) 



which goes to as | 2/2 — ?/i I — by dominated convergence and assumptions 
(5)-(6). Thus y 1— t- Ttf{y) is continuous. Because the jumps in Y are positive, 
for alH > and y > 0, 

Y,y>^{t,y). 

Then, if f{y) — > as y — > cx), we also have (for fixed t) fiXt) — so that 
rt(Co) CCo. 
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Finally, take f (z Cc a continuous function with compact support. Prom 
equation (76), for any y > 0, 

I Ttfiy) - f{y) I <| /(0(t,y)) - f{y) \ H{t,y)+ \ f{y) \ (1 - H{t,y)) 

/"OO 

+ / / \Tt-sf{z)h{z - (j){s,y))<p{(l){s,y))H{s,y) \ dzds 
Jo J<t>{s,y) 

<\ fm, y)) - fiy) I + II / Hoc (1 - exp ( - Tpt)) 



+ II / lloo 'Z' / exp {- (fs) I ds. 
Jo 



Now because f € Cc, \ f{(j){t,y)) — f{y) |— > as t — > 0, uniformly in y, and 

W Ttfiy) -f{y) lloo^O. 

The theorem is proved using the density of Cc in Cq and the contraction 
property of T^. □ 

In Kurt's Theorem, by the continuity of the semigroup, to prove the con- 
vergence of the stochastic processes it is enough to show the convergence of 
the semigroup on a dense subset of Cq. We therefore restrict our discussion 
to C^, the class of all infinitely differentiable functions such that / and all 
its derivatives belong to Cq. 

To apply this theorem, we show the convergence of the semigroup through 
their time-derivative formula, which is provided by Dynkin's formula (see 
Section 3.1). 

Proof of Theorem 10. In (43), choose a special test function g{x,y) = 
x^fiy) with / € Cq°. Then we have 

jEix^fiYt)] = -7inE[xr/(yt)]-72]E[xryi/'(yi)] 

+ X2E[X^^'f'{Yt)] 



^{Yt)f{Yt) h{z - Xt)z^dz - 



(77) + E 

Since 

h{z — x)z^dz — = I h{z - x){z - X + x)'^dz — x^ 



f OO ^ / \ 

/ /i(z-x)^ (")(^-x)V-'=(iz-x" 

k=0 

I — 1 \ / 



fc=i 
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where K^[h] is the k^^ moment of h, equation (77) becomes 

jE[X^f{Yt)] = -^,nE[xrf{Yt)]-j2^X^Ytf'{Yt)] 

(78) +f;QE'=[/^]E[(^(y,)/(y,)x, 



n—kl 
t J 



Because / is bounded, that each term in equation (78) is dominated by 
either Cji^fc(t) or CfHk{t), k = ■ ■ ■ n, where is a constant that depends 
on /. Now, as in Section 3.1, when n > 1, the quantity E[X"/(y()] rapidly 
approaches its equihbrium value as 71 ^00, so that we can assume 

which is true for t ^ Then, for all n > 1 and t > 0, as 71 — ?• 00, 



E[xr/(yt)] = -^E[X^Ytf'{Yt)] 

+ — E[xr+i/'(l^i)] 
7in '- -' 

n 



Now, we are ready to prove the results for the three different scaling 
regimes. 

(SI). Letting n = 1 in (79), from Proposition 6 and remembering the scaling 
(SI), for all t > and / G and as 71 00, we obtain 



X2E[Xtf'(Yt)] =X2E[h]E[ip(Yt)f'{Yt)]+0(^^y 
Now, let n = in (78) to obtain 

(80) j^miYt)] = -l2E[Ytf'{Yt)] +X2E{h]E[ip{Yt)f'{Yt)]+0[^). 
Thus, it is easy to verify that when 71 — )• 00 and t > e > 

|E[/(y,)] = E[Af{Y,)]+oU^ 
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where A is the generator of the equation (73). 

Now consider a sequence ji^n oo, the appropriate scahng ipn given by 

(51) , and the associated semi-group T„. We have, by Dynkin's formula, for 
all feC^, 

Tn,tf{y) - Ttfiy) = I (T„,,/(y)) - | (TJiy)) ds. 
Fix any e > 0. The above calculation shows that, for all / € C^, and t < T 

(81) \\Tn,tf{y)-Ttf{y)\\ < \\T^J{y) -TJ{y)\\ + ^{T -e), 

71 

where C is a constant. From the assumption on initial convergence, 
\\Tnflf{y) — To/(y)|| — > as n — > oo, and by the strong continuity of the 
semi-group, the first term can then be made arbitrary small. Then the semi- 
group Tn^tf converges uniformly toward Tj/ on any bounded time interval. 
The theorem is proved. 

(52) . For the scaling (S2), the proof is similar to that for SI. From Propo- 
sition 6, the scaling (S2) and (79) for n = 1 gives for all t > and / € C^, 

as 7i oo, 

X2E[XtnYt)] = -^E[XtYtf"{Yt)]+^E[X^f"{Yt)] 

71 71 



+ ^22lME[^(y,)m)]+o(l) 

71 ^71^ 
Repeating this iteratively, we obtain from (79) and Proposition 6, for all 
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t > and / G C^, as 71 00, 

X2E[Xtf'{Yt)] = X2E[h]K[ip{Yt)f'iYt)] 
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+ 



71 



+ 2 



+ 



271 

7iE[^ 
271 

271 



271 



E[v.(y*)r(yO] 



■71 



A2E[/i]E[^(yO/'(>^*)] + ^i^E[^(yorm)] 

2jf ^71 



(82) 



g^EMy.)/(^)(y,)]+o(lj. 



Since / is analytic, we have 
h{z -y)f{z)dz = 



h{z -y)f{z - y + y)dz 



fc=o 



k=0 



Therefore, the last summation term in (82) becomes 
Id 



5^^^E[v,(y)/«(y)] =myt){ [ Kz-Yt)f{z)dz-m))] 



k>l 

where 
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Now, (78) with n = yields 



d 
di 



= -j2E[Ytf'{Yt)] 
+ E[ip{Yt){ [ h{z-Yt)f{z)dz 



JYt 



/«))]+o(i) 



= W.\Af(Y,)] + 0(-). 



where A is the generator of equation 74. The rest of the proof is the same 
as for the scahng (SI) and is omitted. 

(S3). Similar to the proof of (2) by simply considering the scaling (S3). The 
details are omitted. 



□ 



4. From a binary "on-off" model to bursting. In this last section, 
we show how the convergence of generators used in Section 3.3 is applicable 
in other cases. In particular, we show how one can obtain a bursting gene 
expression model of the form (12) from a binary "on-off" model (this has 
also been proved in Crudu et al. (2012) using different techniques). 

The binary on-off model is described by a switching ordinary differential 
equation (Mackey and Tyran-Kamihska, 2006) 



where p^{t,x) means the density function with S,{t) = 0, and p^(t,x) the 
density function p^{t, x) with ^(t) = 1. 

This model has been proposed by a number of authors to take into account 
the stochasticity in the state of the DNA (Peccoud and Ycart, 1995; Inno- 
centini and Hornos, 2007; Lipniacki et al., 2006; Ramos and Hornos, 2007). 
It is also known that the steady-state solution of this model converges to the 
steady-state solution of the bursting model when the appropriate scaling is 
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dx 
dx 



^x + 



where ^ is a dichotomous random process, which can take the values or 
A > with switching rates a{x) (from to A) and P{x) (from A to 0). The 
evolution equations for densities are 
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applied, namely the gene switches very rapidly from on to off (/3 oo), and 
the synthesis of the gene product is very efficient in the on state (A oc /3). 

The associated semi-group of (83) acts on functions in Cq x Cq, and for 
any test function / = {f^{x), f^{x)) G C^f x Cq°, the evolution equation of 
the semi-group is 

jTtU) = {p\t,x)f{x)+p\t,x)f\x))dx 

(86) = r xp\t,x)^^^dx- r {-ix-\)p\t,x)^^^dx 

Jo dx Jo dx 

/•oo 

+ / {aix)p\t,x) - (3ix)p\t,x)){f\x) - fix))dx. 



For the sake of simplicity we restrict ourselves to the case where f3{x) 
remains constant, and consider the scaling 

(87) A = 6/3. 

We are going to prove the following result 

Theorem 12. Assume the scaling (87), and a is bounded in . For 
any sequence {/3„}, denote by X^{t) the stochastic process determined by 
(83). Suppose /3n ^ oo as n oo, and {X"(0)} converges in distribution 
to Xq. Then converges in distribution to the one- dimensional stochastic 
process X{t) determined by the solution of 

(88) ^ = -jX + NihiAX),a{X)), X(0) = Xq, 
dt 

where h is an exponential distribution with mean b. 

Proof. The proof follows along very similar lines as in the previous sec- 
tion. Here we briefly show the calculations. 

Consider a function such that f^{x) = f^{x) = f{x), and let 

p{t, x) = p^{t, x) + p^{t, x). 

Then 

d f°° f°° df f°° df 

(89) — / p{t,x)f{x)dx = —j / xp{t,x)—dx + X / p^{t,x)—dx. 
dt Jn Jn dx Jn dx 
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To calculate the last term, we note for any g € C^, 

d r I, , r r 1, ^dg ^ , f°° u ,dg ^ 

— J p [t,x)g{x)ax = —7 J xp {t,x)—ax + X J p [t,x)—ax 

(90) + / a{x)p^{t,x)g{x)dx - j f3p^{t,x)g{x)dx 

Jo Jo 

When (3 ^ 00, one can make a quasi steady-state approximation to (90), 
which gives 

/•oo /•oo /-OO 

J p^{t,x)g{x)dx = — — J xp^{t,x)—dx + — J a{x)p^{t,x)g{x)dx 

Iterating the process and writing g = — — , we have 

dx 



d'f 
X ) 



-|:(^)'fo(.v(,.)g.o(i 

The first sum gives the jump kernel 

a{x)p{t,x)( / h{y - x)f{y)dy - f{x))dx, 



^ Jx 

where /i is a distribution whose moments are given by 

r[/i] = i\(^^y = i\b\ 

The other two summations are dominated by 



|;(-) l^At,x)-J. + ^{-) aix)p\t,x] 



d7 

dx^ 



< C{f) / xp\t,x)dx. 
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This last integral goes to as /3 — ?> oo because 

— I xp^{t,x)dx = / {a{x) — J — (3)xp^{t,x)dx + X / p^{t,x)dx 
dt Jo Jo Jo 

poo poo 

< (ooo — 7 — /3) / xp^(t,x)dx + X / p^(t,x)dx 
Jo Jo 

which allow us to apply the Gronwall's lemma and the fact that 
p^{t,x)dx ^ to conclude the claim is true. 
Now, substitute (92) into (89). When /3 oo, we obtain 

d f°° f°° df 

— J p{t,x)f{x)dx = —7 J xp{t, x)—dx 

poo / poo \ 

+ / a{x)p{t,x)( h{y - x)f{y)dy - f{x)] dx 



JO \Jx 

which is the same as we obtained from the equation 

The remainder of the proof proceeds as in the proof of Theorem 10 and 
the details are omitted. □ 
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